Beyond Reward: The Problem of Knowledge and Data

نویسنده

  • Richard S. Sutton
چکیده

Intelligence can be defined, informally, as knowing a lot and being able to use that knowledge flexibly to achieve one's goals. In this sense it is clear that knowledge is central to intelligence. However, it is less clear exactly what knowledge is, what gives it meaning, and how it can be efficiently acquired and used. In this talk we reexamine aspects of these age-old questions in light of modern experience (and particularly in light of recent work in reinforcement learning). Such questions are not just of philosophical or theoretical import; they directly effect the practicality of modern knowledge-based systems, which tend to become unwieldy and brittle—difficult to change—as the knowledge base becomes large and diverse. The key question for knowledge-intensive intelligent systems is 'What keeps the knowledge correct?' and there have been essentially three kinds of answers: 1) people—human experts understand the knowledge and ensure that it matches their beliefs, 2) internal consistency—the system checks that its knowledge co-heres, and removes inconsistencies, and 3) grounding in data—the system compares its knowledge with external data in some way and changes it as needed to match the data. All of these are valid and often useful ways to maintain correct knowledge, but, in practice, relying on people to maintain correctness has been the dominant approach, supplemented by checks for internal consistency. This approach is well suited to taking advantage of existing human expertise, but ultimately limited in its ability to scale to very large knowledge bases because of its reliance on people. The essence of this approach is that knowledge is essentially public, describing a state of affairs in the world (separate from the intelligent system) that is at least potentially accessible to people. This might be called the public-knowledge approach. In this talk we consider an alternative to the public-knowledge approach that is based on keeping knowledge correct by grounding it in data. Consider the case in which the data involved is the ordinary data available during the routine operation of the intelligent system without human intervention. This is the case of greatest interest because in it the system can correct and learn its knowledge autonomously, enabling scaling to very large knowledge bases (see Sutton 2009, 2001). If the system were a robot, this data would be simply whatever data was available through its sensors and about its motor actions. Knowledge grounded in such sensorimotor data may have no public semantics; …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Source of Human Knowledge: Plato’s problem and Orwell’s problem

Chomsky cannot help wondering at the fact that we, despite so vast evidence, have little knowledge about the obvious evidence. A good example, I think, is the child’s way of first language acquisition. A great many researchers have studied various aspects of child language acquisition at different stages of the child’ life and have brought to light many details of language development. However,...

متن کامل

The Source of Human Knowledge: Plato’s problem and Orwell’s problem

Chomsky cannot help wondering at the fact that we, despite so vast evidence, have little knowledge about the obvious evidence. A good example, I think, is the child’s way of first language acquisition. A great many researchers have studied various aspects of child language acquisition at different stages of the child’ life and have brought to light many details of language development. However,...

متن کامل

Effect of Couples Counseling Based on the Problem-Solving Approach on the Fear of Delivery, Self-Efficacy, and Choice of Delivery Mode in the Primigravid Women Requesting Elective Cesarean Section

Background: Fear is an important factor that causes pregnant women to opt for cesarean section. Women with the fear of childbirth consider labor pain to be beyond their power. Basically, these women request cesarean section only to avoid normal vaginal delivery, which indicates their low self-efficacy in normal vaginal delivery. Objectives: The present study aimed to investigate the effects of...

متن کامل

تأثیر فرهنگ سازمانی بر مدیریت دانش (مورد مطالعه: وزارت صنایع و معادن)

In this article, the effects of organizational culture on knowledge man-agement have been examined. The samples of this survey are the em-ployees of Industries and Mines Ministry and the data have been col-lected by questionnaire. Individual initiative, Management support, In-tegration, Control, Risk tolerance, Reward system, Identity, Direction, Communication patterns and Conflict tolerance ha...

متن کامل

The exploration of challenges in clinical knowledge management in nurses: a qualitative study

Background and Purpose: Clinical knowledge management (CKM) is considered as a dominant approach for information management and expansion of knowledge in clinical settings. Health care executives have recently begun to focus on CKM. Therefore, identification of challenges against proper CKM planning is of paramount importance. The aim of this study was to explore challenges in clinical knowl...

متن کامل

Effect of Relocation and Rotation on Radial Efficiency Scores for a Partially Negative Data Problem

   Negative data handling has gained a remarkable importance in the literature of Data Envelopment Analysis (DEA) to address many real life problems. Various erstwhile applications, in this arena, referred relocation of the origin to a superior (RDM) or to an inferior (Translated Input Oriented BCC) neighboring point. In this paper, the conditions for Rotation Invariance of various Data Envelop...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011